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ABSTRACT 


A webcast system delivers Web content from a webcast 
center over a broadcast medium to many clients. The web- 
cast center has a server unit that gathers Web pages from 
sites on the Internet and stores the pages in a cache. The 
server unit bundles the pages from the cache into package 
files and stores the package files in a package store. The 
webcast center also has a broadcast unit that retrieves the 
package files from the package store and delivers the pack- 
age files to the clients over the broadcast medium. Each 
client is equipped with a receiver to receive the broadcast 
package files. The cHent maintains a subscription database to 
store a directory of the Web content gathered by the webcast 
center. A subscriber user interface enables a user to select 
preferred Web content from the directory of the subscription 
database. ITie client creates a filter based on the user's 
preferences which is used to direct the receiver to collect 
only the package files carrying the preferred Web content, 
while rejecting packages carrying unwanted Web content. 

34 Claims, 5 Drawing Sheets 
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SYSTEM AND METHOD FOR DELIVERING 
WEB CONTENT OVER A BROADCAST 
MEDIUM 

TECHNICAL FIELD 

This invention relates to systems and methods for deliv- 
ering Web related information over a broadcast medium. 
This invention further relates to computer devices and 
software used to implement aspects of the systems and 
methods. 

BACKGROUND OF THE INVENTION 

Public networks, and most notably the Internet, are 
emerging as a primary conduit for communications, 
entertainment, and business services. The Internet is a net- 
work formed by the cooperative interconnection of comput- 
ing networks, including local and wide area networks. It 
interconnects computers from around the world with exist- 
ing and even incompatible technologies by employing com- 
mon protocols that smoothly integrate the individual and 
diverse components. 

The Internet has recently been popularized by the over- 
whelming and rapid success of the World Wide Web (WWW 
or Web). The Web links together various topics in a complex, 
non-sequential web of associations which permit a user to 
browse from one topic to another, regardless of the presented 
order of topics. The Web is rapidly evolving as a standard for 
distributing, finding, and accessing information of any type. 
A "Web browser" is an application that executes on the 
user's computer to navigate the Web. The Web browser 
allows a user to retrieve and render hypermedia content from 
the WWW, including text, sound, images, video, and other 
data. 

The amazing growth rate in the demand for data over the 
Internet is partly due to an increasing audience. The World 
Wide Web has crossed the threshold that makes it affordable 
and interesting to a much larger audience. There is infor- 
mation available on a very wide variety of topics, and tools 
exist to help people find and view the information cost 
effectively. Another factor fueling the Internet growth is the 
increasing data demands per individual user. There is a trend 
for web sites to evolve from using pure text to richer media, 
such as pictures, sound, and video. Adding these richer 
media is popular because they present information more 
clearly, thereby enhancing a site's impact and popularity. 

Unfortunately, a problem facing the continued growth and 
acceptance of the Internet is that conventional methods for 
accessing the Web do not scale well to meet the rapid growth 
in demand. The quality of service for the Web is intuitively 
measured by the user as the amount of time between 
requesting a Web page and being able to view it. Internet 
users have been conditioned through their experiences with 
television and standalone multimedia applications to expect 
instantaneous results on demand. Users are accustomed to 
changing the channel and instantaneously viewing the video 
content for that channel on the screen. The Internet is unable, 
however, to deliver data instantaneously. For the most part, 
the Internet has significant latency problems that reduce 
fairly routine Web browsing exercises to protracted lessons 
in patience. 

The basic dilemma is that the quality of service degrades 
as more people try to use the Web. More unsetding is the 
corollary that service for popular Web sites is typically much 
worse than service for unpopular sites. At the root of the 
service problem is the inability to serve Web data rapidly as 
a result of too little bandwidth in the distribution network. 
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"Bandwidth" is the amount of data that can be moved 
through a particular network segment at any one time. The 
Internet is a conglomerate of different technologies with 
different associated bandwidths. Distribution over the Inter- 
net is usually constrained by the segment with the lowest 
available bandwidth. 

Consider the Internet system 20 shown in FIG. 1. The 
Internet system 20 includes a Web server 22 that stores and 
serves data over the Internet 24 to regional point of presence 
(POP) operators or independent service providers (ISPs), as 
represented by ISP 26. The ISP 26 provides connectivity to 
the Internet 24 to many users, as represented by subscriber 
computers 28, 30, and 32. 

The ISP 26 is connected to the Internet 24 via a network 
connection 34. In this example illustration, the network 
connection 34 is a "Tl" connection. "Tl" is a unit of 
bandwidth having a base throughput speed of approximately 
1.5 Mbps (Megabits per second). Another common high 
bandwidth connection is a T3 connection, which has a base 
throughput speed of approximately 44.7 Mbps. For purposes 
of explaining the state of the technology and the practical 
problems of delivering content over the Internet, it is suflS- 
cient to understand that there is also a limited bandwidth 
connection between the Web server 22 and the Internet 24. 

The subscriber computers 28, 30, and 32 are connected to 
their host ISP 26 via home entry lines, such as telephone or 
cable lines, and compatible modems. As examples of com- 
mercially available technology, subscriber computer 28 is 
connected to ISP 26 over a 14.4K connection 36, which 
consists of a standard telephone line and a V.32bis modem, 
to enable a maximum data rale of 14,4 Kbps (Kilobits per 
second). Subscriber computer 30 is connected to the ISP 26 
with a 28. 8K connection 38 (telephone line and V.34 
modem) which supports a data rale of 28.8 Kbps. Subscriber 
computer 32 is connected to the ISP 26 with an ISDN 
connection 40, which is a special type of telephone line that 
facihtates data flow in the range of 128-132 Kbps, Table 1 
summarizes connection technologies that are available 
today. 

TABLE 1 


Connection Technologies 

and Throuehout 

Connection Type 

Base Speed (Kbps) 

V.32bis modem 

14.4 

V.34 modem 

28.8 

56K Leased Line 

56 

ISDN BRI (1 channel) 

56-64 

ISDN BRI (2 channels) 

128-132 

Frame Relay 

56-1,544 

Fractional Tl 

256-1,280 

ISDN PRI 

1,544 

Full Tl (24 channels) 

1,544 

ADSL 

2,000-6,000 

Cable Modem 

27,000 

T3 

44,736 


With a Tl connection to the primary distribution network 
24, the ISP 26 can facihtate a maximum data flow of 
approximately 1.5 Mbps. This bandwidth is available to 
serve all of the subscribers of the ISP. When subscriber 
computer 28 is connected and downloading data files, it 
requires a 14.4 Kbps slice of the 1.5 Mbps bandwidth. 
Subscriber computers 30 and 32 consume 28.8 Kbps and 
128 Kbps slices, respectively, of the available bandwidth. 

The ISP 26 can accommodate simultaneous requests from 
a number of subscribers. As more subscribers utilize the ISP 
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services, however, there is less available bandwidth to lecture that scales to meet rising demand. There remains a 

satisfy the subscriber requests. If too many requests are need to develop improved techniques for facilitating distri- 

received, the ISP 26 becomes overburdened and may not be bution of Web content over the Internet, 
able to adequately service the requests in a timely manner, 

causing frustration to the subscribers. If latency problems 5 SUMMARY OF THE INVENTION 

persist, the ISP can purchase more bandwidth by adding ^^^^ invention concerns a system for delivering Web 

additional capacity (e.g., upgrading to a T3 connection or ^^^^^^^ ^^^^ ^ broadcast medium from a webcast center to 

adding rnore Tl connections). Unfortunate y, adding more ^^^^^ ^^^^^^^ ^^^^^^ ^ ^^^^ t^^^ 

bandwidth may not be economically wise for the ISP. The / ^ , ^ .u 1 * . a u a * 

load placed on the ISP lypicaUy fluctuates throughout dif- ^^^^v uVT °1 ""f '° .""f * ""J' 

ferent timcsof theday. AddingexpensivebandwidS^tomorc '° '^'^'"^'^ "^^^ '° ^^^''^ '^^ 
readily service short duration high-demand times may not be 


cast medium. 


profitable if the present capacity adequately services the The server includes a gatherer to continuously gather Web 

subscriber traffic during most of the day. content, lypicaUy in the form of Web pages, from selected 

The latency problems are perhaps most pronounced when ^ites. A scheduler tells the gatherer which sites, and what 

working with video. There are few things more frustraUng to ^'"^^^^ g^^^er the Web content. Preferably, the scheduler 

a user than trying to download video over the Internet. The gathering tmies during off-peak hours at the sites. The 

problem is that video requires large bandwidth in compari- scheduler mainUins a schedule database of desired Web sites 

son to text files, graphics, and pictures. Additionally, unlike ^0°^^°^ ^P0° preferences entered by an admmis- 

still images or text files, video is presented as moving trator at the webcast center, pe gatherer fetches the content 
images that are played continuously without interruption. '° and stores it m a content cache to maintain a current copy of 

Video typicaUy requires a 1,2 Mbps for real-time streaming "^^^ ^he webcast center, 

data. This 1.2 Mbps throughput requirement consumes The gatherer is configurable to gather from each site a 

nearly all of a Tl bandwidth (1.5 Mbps). Accordingly, when home Web page at a root URL (Universal Resource Locator) 

multiple subscribers are coupled to the ISP and one sub- and any additional Web pages within a predefined depth 

scriber requests a video file, there is generally not enough below the root URL. The administrator sets the desired 

capacity to stream the video in real-time from the Web server depth for each site. The gatherer also collects any in-line 

22 over the Internet 24 to the requesting subscriber. Instead, image files referenced by the gathered Web pages, 

the video file is typically delivered in its entirety and only The webcast server has a packager to retrieve the Web 

then played on the subscriber computer. Unfortunately, even content from the content cache and package the Web content 

downloading video files in the block data format is often into package files. The packager stores the package files in 

inconvenient and usually requires an excessive amount of a package store which is separate from the content cache, 

time. The packages include data from the Web content and other 

Consider the following example. Suppose a subscriber information provided by the server, such as the size and 
wishes to access a Web site having a 20-second video clip. 35 modification time. 

At 1.2 Mbps, the 20-second video clip involves download- The broadcast unit takes the packages files from the 

ing a 24 Mbyte file over the Internet. If the user has a modest package store, segments them into individual packages, and 

14.4 Kbps connection, it would take approximately twenty- transmits the packages over the broadcast medium, 

eight minutes to download the entire file. Preferably, the broadcast unit employs a broadcast transmit- 
Now, assume that the subscriber/ISP connection is suffi- 40 ler configured as a fault tolerant broadcast file transfer 

ciently large to handle real-time video streaming of the video system. The broadcast medium may be any medium that 

file, meaning that the subscriber computer can render the supports multicast package transports. Possible transports 

video data as it is received from the ISP. Despite the include local area Ethernet networks (LANs), and encoding 

bandwidth of the subscriber/ISP connection, real-time video onto digital satellite or broadcast television signals, 
streaming may still be unachievable if the Tl connection 34 45 Each client is equipped with a receiver to receive the 

between the ISP 26 and the distribution network 24 is broadcast packages. The client maintains a subscription 

unable, or unwilling due to policy reasons, to dedicate 1.2 database to store a directory of the Web content gathered by 

Mbps of its bandwidth to the video file. Requests for the the webcast center. A subscriber user interface enables a user 

20-second video clip made during peak traffic times at the to select preferred Web content from the directory of the 
ISP most certainly could not be accommodated by the 50 subscription database. The client creates a filter based on the 

ISP/network connection. Since adding more bandwidth may user's preferences. The filter directs the receiver to collect 

be a poor investment for the ISP, the ISP may have no only the preferred Web content, while ignoring packages 

economic incentive to remedy the latency problem. The carrying unwanted Web content, 

result is that some users might be inconvenienced by the lack As the preferred Web content is received, the client 
of ability to receive streaming video despite their own 55 reconstructs the package files and temporarily stores them in 
connection to the ISP being capable of accommodating a package store. An unpackager reconstructs the Web con- 
streaming video. tent from the package files in the package store. The unpack- 
The latency problem is further aggravated if the connec- ager is configured to determine whether the Web content 
tion between the content server 22 and the distribution received in the broadcast packages is more recent than the 
network 24 is equally taxed. The lack of sufficient band- go same Web content that the user might have collected on 
width at the content server/network link could also prevent his^er own from the same site. If the broadcast content is a 
real-time video streaming over the Internet, regardless of the more recent copy, the client retains that version; otherwise, 
bandwidths of the network/ISP link or the ISP/subscriber the client discards the broadcast package files in favor of the 
link. If all links lack sufficient bandwidth, the latency more recent version. 

problem can be compounded. 55 The client annotates any hyperlinks contained in the Web 

Accordingly, traditional techniques of adding more band- pages. The annotations differentiate among links that have 

width at each connection do not offer an acceptable archi- been actuated, links that go to content stored locally at the 
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client as a result of the broadcast transmission, and links that 
go to content stored remotely from the client. The annotation 
may be in the form of color variations, or stylistic changes. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a diagrammatic illustration of a prior art Internet 
system. 

FIG. 2 is a diagrammatic illustration of a webcasting 
system for delivering Web content from a webcast center 
over a broadcast medium to multiple clients. 

FIG. 3 is a functional block diagram of the webcast center. 

FIG. 4 is a functional block diagram of a client. 

FIG. 5 is a diagrammatic illustration of a subscription user 
interface implemented at the client to enable a user to select 
preferred Web content to be delivered over the broadcast 
medium. 

FIG. 6 is a diagrammatic illustration of the subscription 
user interface according to a second implementation. 

DETAILED DESCRIPTION 

FIG. 2 shows a webcast system 50 for delivering Web 
content from a webcast center 52 over a broadcast medium 
54 to multiple clients 56(1)-56(M). The webcast center 52 
gathers Web content from the World Wide Web by visiting 
web sites 58(1)-58(N) via the Internet 60 and fetching 
content from those sites. The Web content is typically in the 
form of Web pages found at the sites. A Web page is a title, 
collection of information, and pointers or "hyperlinks" to 
other information. A Web page may be constructed from 
various types of content including computer data, audio, 
video, animation, bit maps or other graphics, applications or 
other executable code, text, hypermedia, or other multime- 
dia types. 

The webcast center 52 collects Web pages from the 
Internet's World Wide Web 60 and stores them in a page 
cache 62. A system administrator sets a schedule that estab- 
lishes which sites are visited by the webcast center 52, the 
time and frequency of the visits, and the type' of content 
collected. 

Apart from the gathering process, the webcast center 52 
retrieves the pages from the page cache 62, bundles them 
into composite package files, and stores them in a package 
store 64. llie package store 64 is preferably a separate 
database than the page cache 62. The webcast center 52 
fetches the package files from the package store 64, seg- 
ments the package files into individual packages (or 
packets), and transmits the packages over the broadcast 
medium 54. 

The broadcast medium 54 is a unidirectional network in 
which packages are delivered from the webcast center 52 to 
the clients 56(1)-56(M) without requiring return communi- 
cation from the clients. The broadcast medium 54 can be 
characterized as a shared, highly asymmetrical, network 
resource with a limited, if not completely absent, low speed 
return path that does not need to be active to receive 
broadcast transmissions. The broadcast medium 54 may 
comprise the entire distribution network between the web- 
cast center and clients, or it may be a single link in a larger 
distribution network. 

The broadcast medium 54 may be implemented in a 
variety of ways. The broadcast medium 54 might be 
implemented, for example, as a wireless network configured 
for one-way transmission (i.e., satellite, radio, microwave, 
etc.). The broadcast medium 54 might also be configured as 
a network that supports two-way communication (i.e., 
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Internet, LAN (local area network), and WAN (wide area 
network)), but can be used for unidirectional multicasting 
from the webcast center to the clients. 

The clients 56(1)-56(M) represent various types of con- 

5 structions. The clients can be implemented as essentially any 
type of computing device that can receive and reconstruct 
data packages, and render the packages on a display. As one 
possible implementation, the client may be constructed as a 
desktop computer, as represented clients 56(1) and 56(2), 
that are specially configured with software/hardware com- 
ponents described below with respect to FIG. 4. Client 56(1) 
receives broadcast Web content from the broadcast medium 
54 via an Independent Service Provider (ISP) 66, rather than 
receiving the broadcasts directly. On the other hand, client 
56(2) is a broadcast-enabled personal computer that is 
capable of receiving the broadcast packets directly. 

One example implementation of a broadcast-enabled PC 
is described in a co-pending U.S. patent application Ser. No. 
08/653,663, filed Jan. 29, 1996, which is a continuation of 

2Q U.S. patent application Ser. No. 08/503,055, entitled 
"Broadcast-Enabled Personal Computer," filed Jan. 29, 
1996, which is now abandoned. These applications were 
filed in the names of Gabe L. Newell, Dan Newell, Steven 
J. Fluegel, David S. Byrne, Whitney McCleary, James O. 

25 Robarts, Brian K. Moran; William B. McCormick, T. K. 
Backman, Kenneth J. Birdwell, Joseph S. Robinson, Alonzo 
Gariepy, Marc W. Whitman, and Larry Brader. lliis appli- 
cation is assigned to Microsoft Corporation, and is incor- 
porated herein by reference. 

3Q Another implementation of a client is a Web-enabled 
television, as represented by client 56(3), which has a set-top 
box or internal computing unit that permits receipt and 
rendering of Web content. In addition to desktop computers 
and Web-enabled TVs, other possible clients include 

35 workstations, laptop computers, palmtop computers, net- 
work computers, and the like. 

According to an aspect of this invention, another distri- 
bution entity may act as a "client" to the webcast center 52. 
As shown in FIG. 2, the regional Independent Service 

40 Provider (ISP) 66 might be a subscriber to the broadcast 
transmissions received over the broadcast medium 54 from 
the webcast center 52. The ISP 66 stores the webcast content 
and distributes it to its own clientele, such as client 56(1), 
using conventional distribution techniques. 

45 As another example of an intermediary distribution entity, 
a secondary webcast center 68 may function as a "client** to 
the primary webcast center 52. In addition to its own 
independent gathering process, the secondary webcast cen- 
ter 68 also receives and re-broadcasts the Web content 

50 received from the primary webcast center 52 to a set of 
clients 56(4)-56(M) over a broadcast medium 70. One 
implementation of this dual webcast center architecture is 
that the primary webcast center 52 is a primary head end that 
distributes nationally or globally via satellites, and the 

55 secondary webcast center 54 is a regional distributor that 
distributes the Web content via RF (radio frequency) or 
microwave transmission. 

The webcast system 50 advantageously distributes Web 
content to many clients (potentially millions) without bur- 

60 dening the limited bandwidth of the Internet. Broadcasting 
the Web pages over a one-to-many broadcast medium avoids 
the bottlenecks caused by overburdened ISP connections or 
slow client modem connections. Since the ISP might be a 
client of the broadcast medium, the broadcast alternative 

65 offers additional bandwidth at a fraction of the cost that 
would be incurred if the ISP installed additional Internet 
connections, such as Tl or T3 connections. 
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The webcast system 50 also addresses the server scalabil- tional pages. Hyperlinks are symbols or instructions describ- 
ity problem. Many clients can be added to the system to ing where to find other related Web pages on the host 
receive broadcast data at no incremental cost to the webcast computer, or on the Internet. A hyperlink from one hyper- 
center 52. The load on the webcast center 52 is constant media topic to another is normally established by the author 
because only it accesses the sites on the Internet, regardless s of the page and is typically rendered as a graphical icon or 
of the number of clients receiving the broadcast transmis- as highlighted keywords. The following is a simple example 
sions. of a portion of an HTML Web page containing a single 

Webcast Center hyperlink: 

FIG. 3 shows the architecture of the webcast center 52 in Microsoft has a Web page with the latest <A HREF- 

more detail. It generally comprises a server unit 80 and a jq "HTTP://www. microsoft.com/upgrades"> upgrades</ 

broadcast unit 82. The server unit 80 is responsible for A> to its popular word processing program, 

gathering Web pages from the Web 60 and converting the When rendered by a browser, the word "upgrades" 

pages to package files suitable for broadcast transmission. appears highlighted and/or underlined, and the text within 

The broadcast unit 82 is responsible for transmitting the the angled brackets does not appear at all, as follows: 

package files over the broadcast medium 54. Microsoft has a Web page with the latest upgrades to its 

A human administrator sets up a schedule database 84 that popular word processing program, 

contains a Usting or directory of Web content that the By clicking on the highlighted keyword "upgrades," the 

administrator desires to gather. The directory specifies a set user can instruct the Web browser to activate the underlying 

of webcast groups, which may represent content from a URL. in this case, the underlying URL is a subroot of the 

single site, or content from multiple sites. The directory 20 root URL, as follows: 

includes a name for each webcast group, the Web pages to http://www.microsoft.comAipgrades 

be included in each group, and time periods at which the The portion "/upgrades" attached to the root URL defines 

server unit 80 should fetch new updates for that group. a path and file name of the page on the remote host 

A scheduler 86 reads from the schedule database 84 and computer, or other server specified by the host computer, one 

establishes a schedule of sites and times for fetching content 25 depth level beneath the home page at the root URL. 

for specific Webcast groups. When the scheduler 86 deter- ITie administrator of the webcast center 52 establishes the 

mines that it is time to retrieve Web pages from a is depth at which the gatherer 88 is to collect pages. For a given 

particular site, the scheduler 86 invokes a gatherer 88 to group, the administrator sets a depth limit specifying the 

gather the Web pages from sites on the Internet 60. TTie number of levels beneath the home page that are to be 

gatherer 88 fetches a specified group according to the 30 collected by the gatherer 88. For example, the administrator 

specifications dictated by the scheduler 86, as entered by the might prescribe a depth limit of three levels. The gatherer 88 

administrator into the scheduler database 84. collects the home page (at depth 0) and all subroots between 

For each group, the gatherer 88 visits one or more sites the home page and any URL having the following structure: 

holding the Web content for the group. The gatherer 88 http://root URL/level 1/level 2/level 3 

accesses a Web page at the root URL (Universal Resource 35 The gatherer 88 travels down the hyperlink paths to 

Locator) for each of the sites. The Web page at the root URL retrieve each page or content specified by the hyperlink. For 

is commonly referred to as the "home" Web page. A URL each page, the gatherer 88 is preferably configured to grab 

describes everything about a particular resource that is any in-line image files referenced by the Web page, 

needed to request the resource from a particular site. The As the pages are gathered, the gatherer 88 stores the pages 

URL describes the protocol the gatherer 88 uses to retrieve 40 in the page cache 62. One preferred optimization technique 

the resource, the name of the computer the resource is on, is to check the modification dates of the pages prior to 

and the path and file name of the resource. The following is retrieving and store them. The gatherer 88 checks these dates 

an example of a root URL: against the modification dates of the same pages that might 

http://www.microsoft.com abeady be held in the page cache 62. Pages that have not 

The "htlp://" portion of the URL describes the protocol. 45 changed since the last time they were gathered (i.e., those 

The letters "http" stand for HyperText Transfer Protocol, the pages having the same modification dates as stored pages) 

set of rules that the gatherer follows to request a document are not copied. Pages that have been updated since they were 

and the remote server will follow to supply the document. last gathered, however, are retrieved and stored in the page 

The "www.microsoft.com" portion of the URL is the name cache 62. In this manner, the gatherer 88 effectively updates 

of the remote host computer that maintains the document. In 50 the webcast groups maintained in the page cache 62 in an 

this example, a home Web page for Microsoft Corporation eflScient gathering process. 

is presented at the root URL. The gatherer 88 informs the scheduler 86 when its finishes 

Web pages are typically written in a "markup language," gathering all of the specified content in a group. The gatherer 

such as SGML (Standard Generalized Markup Language). 88 may then proceed to a next group according to the 

SGML is defined formally as a language for document 55 schedule laid out by the scheduler 86, If the schedule 

representation that formalizes markup and frees it of system dictates, the gatherer 88 may continuously gather pages for 

and processing dependencies. SGML is a language for webcast groups to routinely refresh the pages in the page 

describing the structure of documents and for describing a cache 62. 

tagging scheme to delineate that structure within text. More After the gatherer 88 has updated a group, the scheduler 

specifically, Web pages utilize a subset of SGML called 60 86 invokes a packager 90 to package the webcast group. The 

"HTML" (Hypertext Markup Language). An HTML docu- packager 90 queries the page cache 62 to determine what 

ment can be thought of as plain text that contains formatting pages are in the group, how large the pages are, and how 

instructions in the form of HTML markup codes or "tags." they should be partitioned into package files. In one 

The tags explain how to render and print documents, and are implementation, the individual pages in a group are parti- 

also used to specify hyperlinks. 65 tioned into package files according to the following rides: 

Home pages contain general data about the site and 1. Organize pages in alphabetical order according to their 

commonly provide hyperlinks (or simply "links") to addi- URLs. 
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2. Create a new package file for the first URL. 

3. Add one or more pages to the package file. If adding 
another page to the current package file causes the 
package file to exceed a maxunum size (as set in the 
server unit configuration) or causes the package file to 
contain pages from more than one site, close the 
package file and create a next new package file. 

For each set of URLs to be put in the same package file, the 
packager ^ retrieves the pages and information from the 
page cache 62, appends header information about the group, 
and stores the package file in the package store 64. When the 
packager 90 completes all package files for the group, it 
informs the scheduler 86 that the group is updated. 

At this point, the package files are essentially broadcast 
ready and only need to be packetized into discrete packets 
for broadcast transmission. The package files represent a 
more up-to-date version of the Web content for the webcast 
group. The broadcast unit 82 may now broadcast the web- 
cast group to provide the up-to-date version to the clients. 

It is noted that the scheduler 86, gatherer 88, and packager 
90 are preferably implemented in software that executes on 
one or more computers at the webcast center 52. These 
programs are stored in memory, such as the disk memory of 
the computers, and execute on processing units. As one 
possible example, the computers at the webcast center might 
be implemented as servers that run a server operating system 
such as Windows® NT from Microsoft Corporation, or a 
UNIX-based operating system. The scheduler 86, gatherer 
88, and packager 90 are software modules that run atop the 
operating system on the server. 

With continuing reference to FIG. 3, the broadcast unit 82 
takes the package files in the package store 64, converts 
them to transmittable packets, and broadcasts the packets 
over the broadcast medium 54. The broadcast unit 82 is is 
preferably implemented at a broadcast head end (e.g., sat- 
ellite uplink, cable head end, TV broadcast station, etc.), 
which may or may not comprise the same group of server 
computers of the server unit 80. It may be desirable to 
physically separate the broadcast unit 82 from the server unit 
80. The server unit 80 involves human administration in 
editing the scheduler database 84 in response to changes in 
the webcast groups, the schedule, client preferences, and the 
web sites. It may not be feasible to locate people at the 
webcast head end. 

With respect to the broadcast unit 82, an administrator sets 
up a transfer schedule database 92 that contains a listing of 
when or at what frequency various webcast groups are to be 
transmitted over the broadcast medium 54 to the clients. The 
administrator can edit the database 92 remotely, or transfer 
the information over the network from the server unit 80 to 
the database 92, if the broadcast unit 82 is physically remote 
from the server unit 80. A store and forward scheduler 94 
reads the database 92 to determine when sets of package files 
pertaining to scheduled webcast groups are to be transmitted 
over the broadcast medium 54. 

Transmission of a package file is a two-stage process. The 
first stage is to announce a time that the package or set of 
packages pertaining to a webcast group will be transmitted. 
The second stage is to send the packages themselves at the 
announced time. 

During the announcement stage, the store and forward 
scheduler 94 invokes an announcement generator 96 to 
create announcements of upcoming broadcast transmissions. 
The announcements contain many of the details pertaining 
to delivery and receipt of the broadcast files and include 
sufficient information for the client to decide whether to 
receive the packages. The announcements might contain, for 
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example, the name of the webcast group to be transmitted, 
an address (or frequency) and time of the transmission, the 
size of the packages, the range of URLs for the group, 
whether to place the packages in the chent's browser cache 
5 or file system, and so forth. The announcements are typically 
orders of magnitude smaller than the package files them- 
selves. 

The annoimcements are sent on a well-known address to 
which all clients listen. The announcements are preferably 

10 sent using Session Announcement Protocol (SAP), a proto- 
col typically used to announce multicast data over networks. 
The SAP protocol itself is well known, and is described in 
M. Handley "SAP: Session Announcement Protocol", 
INTERNET-DRAFT, draft-ietf-mmusic-sap-OO.txl, Nov. 27, 

15 1996. 

Various announcement techniques may be used with this 
invention. One example of a suitable announcement tech- 
nique is described in a co-pending U.S. patent application 
entitled, "Transmission Announcement System And Method 

20 For Announcing Upcoming Data Transmissions Over A 
Broadcast Network," filed in the names of Kenneth J. 
Birdwell, Brian Moran, Randy Sargent, Carl Witty, and 
David S. Byrne. This application is assigned to Microsoft 
Corporation and is incorporated by reference. 

25 After a sufficient delay to allow the clients time to receive 
the announcements and decide whether to receive the Web 
content packages, the store and forward scheduler 94 
invokes a broadcast transmitter 98 to transmit the packages. 
The broadcast transmitter 98 retrieves the package files from 

30 the package store 64 and segments the package files into 
transmittable packets or packages. The size and configura- 
tion of the packages differ depending upon the broadcast 
mediima and transport being used. For example, a broadcast 
transmitter embodied for satellite transmission will create 

35 digital packets of one size and configuration, whereas a 
broadcast transmitter embodied for LAN transmission will 
create Ethernet packets of a different size and configuration. 
As another example, the broadcast transmitter might con- 
figure the packages to be carried as part of a VBI (vertical 

40 blanking interval) signal in conventional television broad- 
casts or as part of a cable TV transmission. The broadcast 
transmitter 98 transmits the packages to the address (or 
frequency) and (as near as possible) at the time listed in the 
announcement. 

45 The broadcast transmitter 98 preferably implements a 
robust one-way file transfer protocol that can recover from 
a certain amount of data loss through the broadcast medium. 
One example of an effective protocol is a broadcast file 
transfer protocol (BFTP), which is explained in greater 

50 detail in a co-pending U.S. patent application Ser. No. 
08/871,657, entitled "Data Delivery System And Method 
For Delivering Data And Redundant Information Over A 
Unidirectional Network," filed Jun. 9, 1997 in the names of 
Carl Witty, Kenneth Birdwell, and Randy Sargent, now U.S. 

55 Pat. No. 6,081,907. This application is assigned to Microsoft 
Corporation and incorporated herein by reference. 

It is noted that the scheduler 94 and aspects of the 
announcement generator 96 and broadcast transmitter 98 are 
preferably implemented in software that executes on one or 

60 more computers at the broadcast unit 82. These programs are 
stored in memory, such as the disk memory of the 
computers, and execute on processing units. 

The webcast center architecture offers many benefits. One 
benefit is that the Web gathering tasks performed by the 

65 server unit 80 are decoupled from the broadcast tasks of the 
broadcast unit 82. This allows the units to be physically 
separated and operated independently of one another. 
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Another advantage is that it promotes flexibility by allowing 
a more reliable and/or higher bandwidth link between the 
package store 64 and the broadcast transmitter 98 in com- 
parison to the link between the Web 60 and the package store 
64. This allows data to continue flowing to clients even if the 5 
connection between the Web 60 and the server unit 80 goes 
down. 

Another benefit is that the architecture lakes advantage of 
fill broadcast bandwidth, even if the connection between the 
server unit 80 and the broadcast unit 82 transfers less data lO 
than can be broadcast. The broadcast center 82 continuously 
cycles through the package files in the package store 64 and 
broadcasts them multiple times between updates from the 
server unit 80. Cycling improves reliability in case of major 
losses in the broadcast stream and in case of the client not 15 
being available to receive the transmissions. 

Still another benefit of the architecture is the dual cache 
arrangement. Pages pulled from the Web are initially stored 
in the page cache 92. The pages are then subsequently 
bundled and stored as package files in the package store 64, 20 
This dual cache arrangement permits the gatherer 88 to 
gather pages at one data rate (e.g., modem or Tl connection 
speeds), and the packager 90 to update the files in the 
package store 64 at a different data rate. Moreover, the 
broadcast transmitter 98 can then retrieve and broadcast the 25 
package files at a third data rate that is difl:erent, and 
typically much larger, than the first data rate at which the 
gatherer collects Web pages. Thus, the gathering tasks and 
performance are independent of the broadcasting tasks and 
performance. 30 

aient 

FIG. 4 shows the architecture of a client 56 that partici- 
pates in the webcast system. The client 56 is equipped with 
appropriate hardware to receive the broadcast transmission 
packages containing the Web content. For broadcast-enabled 35 
PCs, or Web-enabled televisions, this hardware might com- 
prise a satellite dish that receives the satellite broadcasts 
directly, or a cable box or antenna that receives the digital 
packages as part of a cable transmission or VBI signal. For 
clients connected to an ISP, the hardware might comprise a 40 
modem to receive the packages over a telephone line from 
the ISP, who in turn received the packages from the broad- 
cast medium. For clients connected to a LAN, the hardware 
might comprise a network card or the like to enable recep- 
tion of the broadcast packets over the network. 45 

The client 56 has an announcement listener 100 tuned to 
the announcement address to receive the announcements 
from the webcast center. The announcements received at the 
listener announce future webcast groups that are to be 
delivered over the broadcast medium 54. so 

The client 56 enables the user to select which webcast 
groups to receive. The webcast center occasionally down- 
loads the directory of Web content maintained in the sched- 
uler database 84. The directory is stored in a subscription 
database 102 at the client 56. The subscription database 102 55 
thus holds the Web content offered by the webcast center, 
including the names of the webcast groups, the URL range 
for the pages within the groups, and so forth. 

The client supports a subscription graphical user interface 
(UI) 104, which executes on the client processor, to permit 60 
the user to view the directory in the scheduler database 102. 
FIG. 5 shows subscription UI 104 according to one possible 
graphical layout. The UI 104 presents the list of available 
content, and enables the user to select one or more of the 
webcast groups. In this example, the UI 104 shows possible 65 
webcast groups of CNBC, ESPN, MSNBC, and the Smith- 
sonian Instimte. The directory on UI 104 also indicates the 
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URL depth level, as listed in the parenthetical trailing the 
group name. The user can select one or more groups by 
checking the box in front of the name, as shown by the 
checked MSNBC box. The subscription UI 104 may also 
display other types of information, such as the cost of a 
subscription, the subscription term, and frequency of 
distribution, and so forth. 

FIG. 6 shows a subscription UI 104' according to another 
possible screen layout. In FIG. 6, the UI 104' presents a set 
of categories for the user to select according to their pref- 
erences. In this example, the UI 104' shows categories of 
literature, music, news, and sports. The user selects various 
categories, which are then translated to one or more webcast 
group in the subscription database 102 that comply with the 
category selections. 

The subscription user interfaces shown in FIGS. 5 and 6 
are provided for example purposes. Many other types of 
interfaces may be used, with many different ways to enter 
user preferences or likes. 

Based upon the user preferences, the client 56 creates a 
filter 106 to accept the preferred Web content broadcast from 
the webcast center, while rejecting the non-preferred con- 
tent. In the FIG. 4 implementation, the filter 106 is config- 
ured to receive announcements from the announcement 
listener 100 and based on the client preferences in the 
subscription database 102, decide whether the client should 
accept or reject the corresponding upcoming webcast group. 
The filter 106 might examine the group name, the URL 
range, and other information contained in the announcement 
to aid in the decision process. Depending on the decision, the 
filter 106 controls a receiver 108 to receive or ignore the 
associated content packages from the broadcast medium 54. 

If the announcement is for a desired group, such as pages 
from the MSNBC site, the filter 106 invokes the receiver 108 
and tells it which port to receive the incoming packages. The 
filter 106 also teUs the receiver 108 what to name the file for 
storing the incoming packages based on the announcement. 
The receiver 108 interprets the protocol for delivering the 
packages and reconstructs the package file. The receiver 
stores the package file in a temporary package store 110 on 
the client's hard drive. When the package file is complete, 
the receiver 108 informs the filter 106 of whether the transfer 
is successful. If the transfer fails, the incomplete package file 
is deleted and the event is logged. 

If the package file arrives intact, the filter 108 invokes an 
unpackager 112 to reconstruct the Web pages from the 
package file. The unpackager 112 may also be configured to 
determine whether the Web pages received from the broad- 
cast medium 54 are more recent than the same Web pages 
that may already be stored at the client. In one possible 
situation, the user might have visited a Web site indepen- 
dently of the webcast transmission and downloaded pages 
from the Web site that are more recent than the version being 
broadcast by the webcast center. In this case, the client does 
not overwrite the more recent version, but instead disregards 
the Web content unpackaged by the unpackager 112. Assum- 
ing the pages received from the webcast center are more 
recent, the unpackager 112 stores the Web pages in a 
browser cache or file system 114. A browser (not shown) can 
then access the pages and render them on the client. An 
example of a suitable browser is the Internet Explorer 
browser sold by Microsoft Corporation. 

An annotator 116 may be invoked to annotate the hyper- 
links contained within the Web pages. The annotator 116 
differentiates among links that have been actuated. Links that 
go to content stored locally in the file system or cache 114, 
and links that go to content stored remotely from the client. 
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The annotation may be to use different colors to set apart the 
three sets of links, or to use different font or format styles. 
Annotated links make it more convenient for the user to 
quickly discern whether the content is available locally, and 
hence ready for immediate access, or whether he/she must 
access the content over the Internet, which may involve 
some delay. 

The announcement listener 100, subscription UI 104, 
filter 106, unpackager 112, and annotator 116 are preferably 
implemented in software that executes on the client 
computer, set-top box, or television. These programs are 
stored in memory, such as disk memory or ROM (Read Only 
Memory), and execute on a processor. As one possible 
example, the client is a personal computer that runs a 
graphical windows-based operating system such as Win- 
dows® 95 from Microsoft Corporation. As another example, 
the client is a Web-enabled television that runs an operating 
system that is a derivative of the Windows(S)-brand operating 
systems. 

Although the invention has been described in language 
specific to structural features and/or methodological steps, it 
is to be understood that the invention defined in the 
appended claims is not necessarily limited to the specific 
features or steps described. Rather, the specific features and 
steps are disclosed as preferred forms of implementing the 
claimed invention. 

What is claimed is: 

1. A system for delivering Web content over a broadcast 
medium, comprising: 

a webcast center to actively gather Web content from sites 
on the Internet by visiting the sites and fetching content 
from the sites, the webcast center storing the gathered 
Web content, the webcast center having a broadcast 
transmitter to transmit the gathered Web content over 
the broadcast medium; and 

multiple clients equipped with filters to receive a portion 
of the gathered Web content from one or more selected 
sites; and 

wherein the Web content comprises Web pages, and a 
packager is configured to organize the pages in alpha- 
betical order according to their URLs (Universal 
Resource Locator) and to form package files that con- 
tain pages from same sites. 

2. A system as recited in claim 1, wherein the webcast 
center is configured to continuously gather the Web content, 

3. A system is recited in claim 1, wherein the webcast 
center actively gathers the Web content by retrieving a Web 
page at a root URL (Universal Resource Locator) and any 
additional Web pages within a Predetermined depth below 
the root URL. 

4. A system as recited in claim 1, wherein the webcast 
center gathers Web pages and any in-line image files refer- 
enced by the Web pages. 

5. A system as recited in claim 1, wherein the packager 
packages the Web content in the content cache into broad- 
cast package files, and wherein the webcast center further 
comprises: 

a content cache to store the Web content gathered from the 
sites; and 

a package store to store the broadcast package files prior 
to broadcast transmission. 

6. A system has recited in claim 1, wherein the webcast 
center has a scheduler to schedule the sites and times to 
actively gather the Web content from the Internet. 

7. A system as recited in claim 1, wherein each client has 
a subscription database to store a user*s Web content 
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preferences, the subscription database being utilized by the 
client's filter to selectively receive the preferred Web con- 
tent. 

8. A system as recited in claim 1, further comprising a 
5 content server configured to receive the broadcast Web 

content and to serve the Web content to one or more of the 
clients. 

9. A system for delivering Web content over a broadcast 
medium, comprising: 

10 a webcast center to actively gather Web content from sites 
on the Internet by visiting the sites and fetching content 
from the sites, the webcast center storing the gathered 
Web content, the webcast center having a broadcast 
transmitter to transmit the gathered Web content over 

15 the broadcast medium; 

multiple clients equipped with filters to receive a portion 
of the gathered Wet? content from one or more selected 
sites; and 
wherein: 

the broadcast transmitter at the webcast center trans- 
mits the Web content in individual packages; and 
each client has an unpackager to reconstruct the Web 
content from the packages, the unpackager being 
2j configured to determine whether the Web content 

received from the webcast center is more recent than 
any Web content from an identical site that may 
already be stored on the cUent. 

10. A system for delivering Web content over a broadcast 
medium, comprising: 

a webcast center to actively gather Web content from sites 
on the Internet by visiting the sites and fetching content 
from the sites, the webcast center storing the gathered 
Web content, the webcast center having a broadcast 
35 transmitter to transmit the gathered Web content over 
the broadcast medium; 

multiple clients equipped with filters to receive a portion 
of the gathered Web content from one or more selected 
sites; and 

40 wherein each client has an annotator configured to anno- 
tate hyperlinks within the Web content to differentiate 
among first hyperlinks that access content not stored 
locally at the client and second hyperlinks that access 
content stored locally at the client. 

45 11. A webcast center comprising: 

a gatherer to actively gather Web content from sites on the 
Internet by visiting the sites and fetching content from 
the sites; 

a content cache to store the Web content actively gathered 

from the sites by the gatherer; 
a packager to package the Web content stored in the 

content cache into broadcast package files; 
a package store to store the broadcast package files prior 
55 to broadcast transmission; 

a broadcast transmitter to segment the package files from 

the package store into individual packages and transmit 

the packages over a broadcast medium; and 
wherein: 

60 the gatherer retrieves the Web content from the sites 
and stores the Web content in the content cache at a 
first data rate; and 
the broadcast transmitter transmits the packages from 
the package store at a second data rate different from 
65 the first data rate. 

12. A webcast center as recited in claim 11, wherein the 
broadcast transmitter continuously transmits the package 
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files in the package store multiple times before the packager page at a root URL (Universal Resource Locator) and any 

updates the package files in the package store. additional Web pages within a predefined depth below the 

13. For use in a system for delivering Web content in root URL. 

individual data packages over a broadcast medium to mul- 18. A system as recited in claim 16, wherein the gatherer 
tiple clients, a package handling unit at each of the clients 5 is configured to retrieve Web pages and any in-line image 

comprising: files referenced by the home Web pages, 

a receiver to reconstruct package files from the data 19. A system as recited in claim 16, wherein: 

packages; the gatherer retrieves the Web content from the sites and 

a package store to temporarily hold the package files stores the Web content in the content cache at a first 

containing the Web content; and ^^^^J 

an unpackager to reconstruct the Web content from the broadcast transmitter transmits the packages from the 

package files, the unpackager being configured to package store at a second data rate different from the 

determine whether the Web content received in the ^^^^ 

package files from the broadcast medium is more recent 20. Asystem as recited in claim 16, wherein the broadcast 

than any duplicative Web content that may already be transmitter continuously transmits the package files in the 

stored at the client as a result of the client retrieving the package store multiple times before the packager updates the 

duplicative Web content from a same Web site from package files in the package store. 

which the Web content in the package files came. 21. A system as recited in claim 16, wherein the webcast 

14. For use in a system for delivering Web content over ^^^^^^ further comprises an announcement generator to send 
a broadcast medium to multiple clients, an annotator pro- announcements informing the client of the Web content to be 
gram executing on each of the clients to annotate hyperlinks transmitted in future broadcast transmissions. 

within the Web content to differentiate among first hyper- 22. A system as recited in claim 16, wherein the client 

links that access content not stored locally at the client and further comprises an annotator to annotate hyperlinks within 

second hyperlinks that access content stored locally at the the preferred Web content, the annotated hyperlinks differ- 

client. entiating between first hyperUnks that access content not 

15. An annotator program as recited in claim 14, further stored locally at the client and second hyperlinks that access 
configured to annotate the hyperiinks to differentiate among content stored locally at the client. 

third hyperiinks that have been actuated at least one by a 23. A method for delivering Web content to clients, 
user at the client. 3^ comprising: 

16. Asystem for delivering Web content over a broadcast actively gathering the Web content from sites on the 
medium, comprising: Internet by visiting the site to retrieve the Web content; 

(A) a webcast center comprising: storing the Web content in a first cache; 

(1) a gatherer to actively gather Web content from sites packaging the Web content from the first cache into 
on the Internet by visiting the sites to retrieve the 35 package files; 

.^x^^*^L^°P\^°*' ^ . , L ^ • . storing the package files in a second cache; 

(2) a scheduler to schedule for the gatherer the sites and .... , ^. ^ . . . 
times to actively gather the Web content; broadcastmg the package files from the second cache over 

(3) a content cache to store the Web content gathered broadcast medium; and 

from the sites by the gatherer; 40 receiving the package files at a client; 

(4) a packager to package the Web content stored in the filtering the package files to retain files containing the 
content cache into broadcast package files; Web content preferred by a user; 

(5) a package store to store the broadcast package files; reconstructing the preferred Web content from the 
and retained package files; and 

(6) a broadcast transmitter to segment the package files 45 annotating hyperlinks within the Web content to differ- 
from the package store into individual packages and entiate among first hyperlinks that access content not 
transmit the packages over a broadcast medium; stored locally at the client and second hyperiinks that 

(B) a client comprising: access content stored locally at the client. 

(1) a receiver to receive the broadcast packages; 24. A method as recited in claim 23, further comprising 

(2) a subscription database to store a directory of the 50 retrieving, at each of the sites, a Web page at a root URL 
Web content gathered by the webcast center; (Universal Resource Locator) and any additional Web pages 

(3) a subscriber user interface executing at the client to within a predetermined depth below the root URL. 
enable a user to select preferred Web content from 25. A method as recited in claim 23, further comprising 
the directory of the subscription database; retrieving Web pages and any in-line image files referenced 

(4) a filter to direct the receiver to accept the packages 55 by the Web pages. 

carrying the preferred Web content selected by the 26. A method as recited in claim 23, further comprising 

user, while rejecting other packages carrying non- scheduling the sites and times to gather the Web content 

preferred Web content, and to reconstruct the pack- from the Internet. 

age files from the accepted packages; 27. A method as recited in claim 23, further comprising 

(5) a client-side package store to temporarily hold the 60 gathering the Web content at a first data rate and broadcast - 
package files received by the receiver; and ing the package files at a second data rate that is different 

(6) an unpackager to determine whether the Web con- from the first data rate. 

tent in the package files received from the broadcast 28. A method as recited in claim 23, further comprising 

medium is more recent than any other version of the examining the preferred Web content after reconstructing to 
Web content that may already be stored at the client. 65 determine whether the Web content is more recent than any 

17. Asystem as recited in claim 16, wherein the gatherer other version of the Web content that may already be stored 
is configurable, for each of the sites, to gather a home Web at the client. 
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29. In a system for delivering Web content over a broad- 
cast medium from a webcast center to multiple clients, a 
computer-implemented method for cumulating Web content 
at the webcast center comprising instructions for: 

gathering the Web content from sites on the Internet by 
visiting the site to retrieve the Web content; 

storing the Web content in a first cache; 

packaging the Web content from the first cache into 
package files; 

storing the package files at a second cache; 

gathering the Web content from the sites; and 

storing the Web content in the content cache at a first data 
rate that is diflerent from and independent of a second 
data rate at which the packages from the package store 
are transmitted over the broadcast medium. 

30. A computer-implemented method as recited in claim 
29, further comprising: 

segmenting the package files from the package store into 

individual packages; and 
transmitting the packages over a broadcast medium. 
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31. A computer-implemented method as recited in claim 
29, further comprising transmitting the packages multiple 
times between updating the Web content in the package files. 

32. In a system for delivering Web content over a broad- 
cast medium from a webcast center to multiple clients, a 
computer-implemented method comprising annotating, at a 
client, hyperlinks within the Web content to differentiate 
among first hyperlinks that access content not stored locally 
at the cHent and second hyperlinks that access content stored 
locally at the client. 

33. A method as recited in claim 32, further comprising 
annotating the hyperlinks to differentiate among third hyper- 
links that have been actuated at least one by a user at the 
client. 

34. A computer-readable medium having computer- 
readable instructions for performing the method recited in 
claim 32. 
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